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Summary:  Integration  and  test  of  the  GEMINI  instrument  package.  Most  of  the 
work  performed  under  this  task  supported  development  of  the  GEneralized  Multi-wavelength 
INfrared  Imager  (GEMINI)  for  the  Maui  Space  Surveillance  Site.  GEMINI,  not  to  be  con¬ 
fused  with  the  National  Science  Foundation's  Gemini  Telescopes  Project,  is  a  one-of-a-kind 
sensor  package  built  for  USAF  Space  Command  operational  use  in  conjunction  with  its 
space  object  identification  (SOI)  mission.  GEMINI  hosts  a  variety  of  sensors  for  use  with 
several  algorithms  for  reconstructing  high-resolution  satellite  and  astronomical  images  from 
data  degraded  by  atmospheric  turbulence. 

The  first  GEMINI  operating  mode  made  available  to  Space  Command  delivers  high- 
resolution  images  using  a  speckle  imaging  technique.  Schafer  contributed  to  the  project  by 
implementing  and  developing  parallel  speckle  imaging  recovery  and  calibration  algorithms 
and  by  designing  computer  simulations  to  model  real-world  detector  effects  and  predict  avail¬ 
able  resolution.  Schafer  also  conducted  an  observational  evaluation  of  the  speckle  imaging 
algorithm. 

A  distinctive  feature  of  the  GEMINI  data  acquisition/reduction  system  is  the  provision 
for  near  real-time  (NRT)  video  feedback  of  reconstructed  imagery  to  the  GEMINI  operator. 
NRT  feedback  allows  the  operator  to  select  camera  exposure  times  and  optical  filters  based  on 
the  visual  quality  of  reconstructed  data.  These  operating  parameters  can  thus  be  optimized 
u'hik  data  is  being  acquired,  potentially  eliminating  the  need  for  subsequent  observations 
of  the  same  object.  For  example,  raw  image  data  of  a  bright  satellite  can  yield  a  poor 
reconstruction  if  most  of  the  light  is  concentrated  in  a  strong,  localized  glint.  Selection  of 
a  different  filter  may  reduce  the  glint  effect  in  the  reconstruction,  which  would  be  apparent 
to  the  operator  using  an  NRT  imaging  system.  Also,  NRT  reconstruction  can  be  used  to 
reduce  even  large  amounts  of  data  is  a  timely  manner,  allowing  fast  turnaround  between 
observing  and  imaging  without  simply  ignoring  most  of  the  data  (as  was  commonly  done  at 
the  MSSS  prior  to  GEMINI). 
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The  speed  required  for  NRT  imaging  was  obtained  with  a  parallel  implementation  of 
speckle  imaging  reconstruction  and  calibration  algorithms.  The  code  is  executed  on  the 
MHPCC's  IBM  SP2  supercomputer.  Schafer  was  responsible  for  design,  parallelization,  and 
optimization  of  all  image  reconstruction  and  calibration  software,  as  well  as  joint  develop¬ 
ment  of  the  entire  data  acquisition  system  with  a  multi-contractor  team. 

During  the  period  March-Nov  1998,  Schafer  participated  in  laboratory  and  operational 
testing  of  GEMINI  hardware  and  softare.  The  data  acquisition  and  processing  pipeline 
was  a  multi-contractor  team  effort,  with  Schafer  responsible  for  parallel  (NRT)  execution 
of  the  speckle  imaging  algorithm.  This  software,  along  with  Schafer-developed  calibration 
algorithms,  was  tested  with  lab  data  and  telescope  data,  once  GEMINI  was  installed  on  the 
MSSS  1.6-m  telescope.  Schafer  was  also  responsible  for  evaluating  GEMINI  resolution  under 
a  variety  of  imaging  conditions.  Sample  “first  light’’  results,  of  a  binary  star  and  the  orbiting 
Hubble  Space  Telescope,  are  shown  below.  Figure  1  shows  a  surface  plot  of  uncalibrated 
(“raw”)  data  from  an  observation  of  the  binary  star  ADS  10598.  Note  that  due  to  the 
corrupting  effects  of  the  Earth’s  atmosphere,  the  intensity  distribution  bears  no  discernable 
resemblance  to  that  of  a  binary  star.  In  Fig.  2,  where  the  result  of  processing  the  data  with 
the  Schafer  implentation  of  the  speckle  imaging  algorithm  is  shown,  the  two  components  of, 
the  binary  are  clearly  visible  and  resolution  is  nearly  diffraction-limited.  Figure  3  shows  a 
processed  image  of  the  Hubble  Space  Telescope.  Note  the  aperture  cover  is  clearly  visible 
in  the  “open”  position,  and  the  slight  “twist”  in  the  Hubble’s  solar  panels,  seen  in  photos 
taken  by  space  shuttle  crews,  is  readily  apparent. 
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Fig.  2:  Bispectrum  speckle  imaging  recovery  of  ADS10598. 
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Fig.  3:  GEMINI  image  of  the  Hubble  Space  Telescope 


Summary.  Noise  and  information  transport  in  imagery.  Previous  work  by  Schafer 
has  demonstrated  a  practical  noise-removal  scheme  utilizing  projections  onto  convex  sets. 
During  March-November  1998.  Schafer  developed  a  code  to  calculate  the  information  content 
of  imagery  and  evaluate  the  effect  of  the  convex  projections  scheme  on  the  two-dimensional 
information  distribution  of  processed  image  data.  The  code  was  used  to  validate  an  analyt¬ 
ical  treatment  of  the  noise  and  information  “transport’'  phenomenon  previously  observed. 
Details  are  in  the  attached  paper  by  Prasad  and  Tyler. 

Summary.  A  wavefront  sensor-driven  variable  geometry  pupil  for  ground-based 
aperture  synthesis  imaging.  Schafer  developed  the  concept  of  a  variable  geometry  pupil 
to  increase  image  resolution  for  ground-based  near-IR  and  optical  telescope  imaging.  In  this 
scheme,  a  curvature- type  wavefront  sensor  provides  an  estimate  of  the  wavefront  curvature  to 
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the  controller  of  a  high-resolution  spatial  light  modulator  (SLM)  or  micro-electromechanical 
(MEM)  mirror,  positioned  at  an  image  of  the  telescope  pupil.  This  optical  element,  the 
VGP,  passes  or  reflects  the  incident  beam  only  where  the  wavefront  phase  is  sufficiently 
smooth,  viz.,  where  the  curvature  is  sufficiently  low.  Using  a  computer  simulation.  Schafer 
showed  the  VGP  can  sharpen  and  smooth  the  long-exposure  PSF  and  increase  the  OTF 
SNR  for  tilt-only  and  low-order  AO  systems,  allowing  higher  resolution  and  more  stable 
deconvolution  with  dimmer  AO  guidestars.  Details  are  in  the  attached  paper  by  Tyler  and 
Stribling. 
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A  wavefront  sensor-driven  variable  geometry  pupil 
for  ground-based  aperture  synthesis  imaging 
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ABSTRACT 

We  describe  a  variable-geometry  pupil  (VGP)  to  increase  image  resolution  for  ground-based  near-IR  and  optical 
imaging.  In  our  scheme,  a  curvature-type  wavefront  sensor  provides  an  estimate  of  the  wavefront  curvature  to  the 
controller  of  a  high-resolution  spatial  light  modulator  (SLM)  or  micro-electromechanical  (MEM)  mirror,  positioned 
at  an  image  of  the  telescope  pupil.  This  optical  element,  the  VGP,  passes  or  reflects  the  incident  beam  only  where 
the  wavefront  phase  is  sufficiently  smooth,  viz.,  where  the  curvature  is  sufficiently  low.  Using  a  computer  simulation, 
we  show  the  VGP  can  sharpen  and  smooth  the  long-exposure  PSF  and  increase  the  OTF  SNR  for  tilt-only  and 
low-order  AO  systems,  allowing  higher  resolution  and  more  stable  deconvolution  with  dimrher  AO  guidestars. 

Keywords:  Adaptive  optics.  Aperture  synthesis.  Optical  instrumentation 

1.  INTRODUCTION 

Time- varying,  spatially  random  refractive  index  fluctuations  in  Earth’s  atmosphere  limit  the  resolution  of  even  the 
largest  ground-based  telescopes  to  that  obtainable  by  a  telescope  of  diameter  equal  to  the  Fried  seeing  parameter, 
I'o.  This  limitation  can  be  understood  by  considering  the  effects  of  pupil  redundancy  in  imaging  while  viewing  image 
formation  itself  as  an  interferometric  process.*  In  the  interferometric  model,  the  pupil  of  an  Earth-based  telescope 
is  seen  as  an  ensemble  of  r„-size  ‘'apertures’"  or  patches,  each  pair  of  which  forms  an  interference  pattern  on  the 
detector.  The  fringe  patterns  have  a  variety  of  orientations  and  modulation  frequencies  depending  on  the  orientation 
ami  .separation  of  these  patches,  and  their  incoherent  sum  is  the  detected  image.  In  an  unobscured  telescope  pupil  in 
till’  [jre.sence  of  atmospheric  turbulence,  there  may  be  many  r„  patches  separated  by  identical  distances  and  having 
the  same  orientation.  This  means  the  fringe  pattern  in  a  given  direction  on  the  detector  with  a  given  frequency  is 
actually  the  sum  of  many  individual  redundant  fringe  patterns.  Due  to  turbulence,  the  phase  difference  between  any 
two  redundant  fringe  patterns  is  a  random  variable,  reducing  the  visibility  of  the  sum  fringe  pattern  and  limiting 
the  resolution  of  the  detected  image.  Use  of  adaptive  optics  (AO)  increases  resolution  by  reducing  the  relative  shift 
of  the  individual  fringe  patterns  and  increasing  the  spatial  coherence  of  the  field  incident  at  the  telescope  pupil, 
thus  increasing  the  visibility  of  the  sum  pattern.-  Another,  less  complicated  approach  is  to  limit  redundancy  by 
masking  selected  portions  of  the  telescope  pupil.  Several  relatively  small  openings  can  be  used  to  synthesize  a 
large,  high-resolution  telescope.  While  the  phases  of  the  resulting  fringe  patterns  are  still  randomized  by  turbulence, 
the  resulting  increase  in  visibility  amplitude  SNR  can  be  used  to  enhance  phase  recovery  by  such  methods  such 
as  speckle  imaging  or  self-calibration.®  Of  course,  masking  the  light-gathering  area  of  the  telescope  decreases  the 
available  signal  and  passes  a  reduced  set  of  spatial  frequencies.  This  technique  is  known  variously  as  “optical  aperture 
synthesis.""  ‘'pupir"  or  “aperture  masking.”  “non-redundant  masking”,  and  “masked-aperture  speckle  imaging.”  We 
use  the  nomenclature  “partially-redundant  masking”  (PRM)  as  a  general  description  of  the  several  variations  on  the 
above  theme. 

Since  the  first  astronomical  images  reconstructed  using  aperture  masking^  were  reported  in  1988,  considerable 
effort  has  been  expended  by  the  astronomical  community  to  quantify  the  viability  of  this  technique,®”*®  especially 
in  comparison  wnth  adaptive  optics  systems.**  Initial  studies  and  experiments  were  done  using  masks  with  a  fixed 
(small)  number  of  r^-size  openings.  To  obtain  reasonable  u-v  coverage  with  a  small  number  of  apertures,  researchers 
applied  several  rotations  of  the  mask  relative  to  the  incident  beam.  More  recently,  workers  employed  more  novel 
geometries,  such  as  an  r„-size  annulus®  *®  or  a  slit.*®  These  geometries  are  not  actually  “non-redundant,”  hence  the 
use  of  “PRM.”  Generally.  AO  is  considered  to  be  the  more  robust  of  the  two  methods,  allowing  deeper  imaging  at 


higher  resolution.  As  a  consequence,  there  has  been  some  interest  in  combining  the  techniques. In  this  paper,  we 
describe  an  adaptive  pupil  mask  for  use  with  AO,  driven  by  the  AO  wavefront  sensor.  As  we  will  show,  the  concept 
has  potential  to  augment  the  resolution  provided  by  conventional  AO. 

The  variable-geometry  pupil  (VGP)  we  propose  here  is  a  new  type  of  adaptive  optical  element,  correcting  the 
amplitude  of  complex  visibilities  rather  than  the  phase,  and  realizing  PRM  concepts  in  an  adaptive,  controllable 
way.  Such  an  element  might  be  a  high-resolution  spatial  light  modulator  (SLM)  or  a  micro-electromechanical  (MEM) 
mirror.  The  VGP  would  be  placed  between  the  AO  deformable  mirror  and  the  science  instrument  and  controlled  in' 
such  a  way  that  only  portions  of  the  incident  beam  with  sufficiently  small  phase  variation  would  be  passed  (SLM) 
or  reflected  (MEM)  to  the  science  instrument.  Since  we  envision  the  VGP  would  be  used  with  AO.  an  estimate 
of  the  wavefront  phase  is  readily  available  from  the  AO  wavefront  sensor  (WES).  In  fact,  if  the  AO  system  uses  a 
curvature-sensing  WFS.  the  WFS  provides  the  information  on  phase  “smoothness’"  directly  to  the  VGP  controller. 
In  our  scheme,  the  WFS  would  drive  the  AO  system  in  a  closed  control  loop  while  providing  “go-to"  curvature 
signals  to  the  VGP;  that  is.  there  would  be  no  feedback  between  the  VGP  and  the  WFS.  The  VGP  would  thus 
receive  commands  at  intervals  on  the  order  of  the  inverse  AO  bandwidth,  passing  or  reflecting  light  in  response 
to  the  random  atmospheric  phase  perturbations  and  the  AO  system  corrections.  Since  the  phase  of  the  wavefront 
incident  at  the  WFS  and  VGP  (modulo  noncommon  optical  path  errors)  is  a  random  process,  the  shape  of  the  VGP 
would  also  change  randomly,  potentially  providing  full  u-v  coverage  after  a  few  WFS  commands.  The  VGP.  as  we 
envision  it.  would  overcome  many  of  the  limitations  in  the  fixed-pupil  PRM  concept;  for  example,  the  VGP  can  be 
adjusted  for  seeing  and  object  brightness  conditions  without  modifying  or  replacing  the  mask,  and  there  is  no  need 
for  beam  or  mask  rotation.  We  note  that  the  concept  of  selectively  masking  the  telescope  pupil  during  AO  operation 
is  also  discussed  in  Morossi.  ei  in  these  proceedings. 

In  the  following  section,  we  describe  the  computer  simulation  methods  used  to  model  the  telescope.  AO.  and 
VGP.  In  Section  3.  we  present  simulation  results,  showing  PSFs  and  OTF  SNR  plots  with  and  without  the  VGP  for 
various  AO  systems.  We  summarize  our  work  in  Section  4. 

2.  COMPUTER  SIMULATION  AND  MODELING 

We  model  incoherent  imaging  with  a  3.6-m  telescope  and  two  AO  systems.  The  computer  simulation  we  use  generates 
independent  random  realizations  of  Kolmogorov  turbulence.  The  AO  subroutines  actually  model  a  Shack-Hartmann 
WFS  rather  than  a  curvature  sensor,  but  we  calculate  curvature  separately  after  generating  the  post-AO  phase  map. 
For  the  simulation  results  shown  here,  both  the  AO  and  VGP  are  driven  by  “go-to”  commands  in  open  loop. 

After  AO  compensation,  the  wavefront  curvature  is  calculated  from 

C(r)  =  :F-»{p2$(p)}.  (1) 

where  (T'lr)  is  the  curvature  map  at  pupil  coordinate  r.  $(p)  is  the  Fourier  transform  of  the  pupil  phase  map  0(r). 
and  p  —  1p|.  Since  the  above  operations  are  done  on  the  same  grid  as  the  array  representation  of  the  C(r) 
varies  on  an  unrealistically  small  scale.  To  model  the  finite  sampling  resolution  of  an  actual  curvature  WFS.  we 
calculate  C  from  C  by  smoothing  with  a  Gaussian  filter  having  a  FWHM  corresponding  to  the  subaperture  size  used 
in  the  sim.  The  VGP  pupil  function  is  formed  by  setting  the  telescope  pupil  function  to  zero  wherever  C  exceeds  a 
threshold  value.  A  V’GP  pupil  function,  typical  for  the  conditions  described  in  the  next  section,  is  shown  in  Fig.  1. 
Subsequent  to  the  calculation  of  the  VGP  pupil  function,  a  generalized  pupil  function  is  formed  by  multiplying  the 
phase  screen  and  VGP  pupil  function.  PSF  images  are  formed  from  the  squared  modulus  of  the  Fourier  transformed 
generalized  pupil  function. 


3.  SIMULATION  RESULTS 

Our  baseline  imaging  model  consists  of  a  3.6-m  telescope  and  a  37-actuator  AO  system.  This  configuration  yields  an 
actuator  separation  of  55  cm.  The  Shack-Hartmann  WFS  model  has  30.  48-cm  subapertures.  In  all  our  simulations, 
we  use  a  WFS  phase  measurement  standard  deviation  of  0.18  radians,  corresponding  to  a  guidestar  magnitude  of 
about  6  with  a  sampling  rate  of  1000  Hz.  This  small  error  means  WFS  noise  is  not  the  limiting  factor  in  image 
resolution,  although  we  choose  seeing  conditions  of  r^,  =  20  cm  at  our  imaging  wavelength  of  750  nm  that  are 
somewhat  demanding  for  the  AO  system  (D/j-j,  =  18).  To  avoid  excessive  WFS  aliasing,  we  model  wavefront  sensing 
at  H  band,  where  the  effective  seeing  parameter  is  ss  50  cm.  The  detector  pixels  are  critically  sampling,  with  each 


pixel  subtending  about  26  milli-arcsec.  In  these  seeing  conditions,  the  AO  performance  is  partially-compensating 
due  to  fitting  error,  allowing  for  some  resolution  gain  using  the  VGP.  but  not  unrealistically  degraded. 


Fig.  1:  Sample  \''GP  pupil  function.  White  indicates  low-curvature  regions.  Note  the  telescope  secondary 
obscuration  is  not  readily  apparent  because  it  is  contiguous  with  two  regions  of  high  wavefront 
curvature,  which  are  masked  by  this  realization  of  the  VGP  pupil  function. 


\’GP  operation  is  illustrated  in  Fig.  2.  The  top  part  shows  a  surface  plot  of  the  DM-only  generalized  pupil 
function.  .Notice  the  vertical  scaling,  indicating  the  wavefront  phase,  extends  from  -6  to  -)-6  radians.  While  the 
D.M-only  AO  has  corrected  much  of  the  phase  from  the  D/r„  =  18  seeing,  the  residual  is  significant.  In  the  bottom 
of  Fig.  2.  we  show  a  surface  plot  of  the  VGP/DM  generalized  pupil  function.  Note  the  scaling  now  extends  from  -4- 
to  -f-  4.  and  regions  with  large  phase  excursions  in  the  top  part  of  Fig.  2  have  been  blocked  by  the  VGP. 

In  Fig.  3.  we  show  long-exposure  log  PSFs  using  the  30-subaperture  AO  system  with  and  without  the  VGP. 
The  PSFs  were  formed  by  averaging  the  images  from  300  independent  turbulence  realizations.  No  detector  noise  is 
modeled  for  these  plots,  since  we  desire  to  see  potential  PSF  improvements  prior  to  noise  modeling.  To  distinguish 
between  the  two  types  of  AO.  we  refer  to  conventional  adaptive  optics  without  the  VGP  as  “DM-only’’  AO.  The  use  of 
both  the  DM  and  VGP  is  called  “VGP/DM  AO.’’  Note  the  VGP/DM  PSF  has  more  energy  in  the  diffraction-limited 
core  and  smaller  halo  features. 

If  we  improve  the  DM  correction  by  adding  more  subapertures  and  actuators,  the  improvement  using  the  VGP 
is  not  so  pronounced,  as  we  expected.  In  Fig.  4.  we  show  the  same  PSFs  for  AO  with  40  subapertures  and  45 
actuators.  Although  the  DM-only  PSF  has  improved  considerably,  the  VGP/DM  PSF  still  has  more  core  energy  and 
smaller  sidebands.  Conversely,  degrading  the  AO  performance  causes  a  dramatic  increase  in  the  relative  performance 
of  VGP/DM  AO  over  DM-only  AO.  Fig.  5  shows  PSFs  for  a  30-subaperture  WFS  driving  a  fast-steering  mirror 
(FSM)  for  tilt  removal  and  a  VGP/FSM  combination.  In  this  case,  the  improvement  in  core  energy  using  the  VGP 
is  nearly  an  order  of  magnitude:  in  fact,  there  is  no  discernible  core  without  the  VGP. 

The  implications  of  these  improvements  are  illustrated  in  Fig.  6:  Here,  we  have  plotted  the  DM-only  PSFs  for 
the  30  and  40-subaperture  systems  along  with  the  VGP  PSF  for  the  30-subaperture  system.  Note  the  VGP  PSF 
with  the  30-subaperture  AO  has  nearly  the  same  core  energy  and  lower  sidelobes  than  the  40-subaperture  DM-only 
PSF.  Potentially,  then,  the  VGP  can  extend  the  resolution  obtained  with  DM-only  AO  to  the  dimmer  guidestars 
allowed  with  fewer,  larger  subapertures. 


Fig.  *2:  DM-only  (top)  and  VGP/DM  (bottom)  generalized  pupil  functions. 


Fig.  3:  Long-exposure  log  PSF  cross-sections  for  a  30-subaperture  WFS  driving  a  37-actuator  DM  only 
(solid  line)  and  a  VGP/DM  combination  (dashed  line). 


Fig.  4:  Long-exposure  log  PSF  cross-sections  for  a  40-subaperture  WFS  driving  a  45-actuator  DM  only 
(solid  line)  and  a  VGP/DM  combination  (dashed  line). 


Fig.  5:  Long-exposure  log  PSFs  for  a  30-subaperture  WFS  driving  a  tip-tilt  FSM  only  (solid  line)  and  a 
VGP/FSM  combination  (dashed  line). 


field  ongle  (orcsec) 


Fig.  6:  Long-exposure  log  PSF  cross-sections  for  a  30-subaperture  WFS  driving  a  37-actuator  DM  only  (solid 
line),  a  40-subaperture  WFS  driving  a  45-actuator  DM  only  (dashed  line),  and  a  30-subaperture  WFS 
driving  VGP/DM  combination  (dot-dashed  line). 


While  improved  PSF  morphology  is  desirable,  the  key  to  successful  deconvolution  stability  is  the  OTF  SNR.  To 
assess  the  relative  SNR  performance  of  VGP/DM  and  DM-only  AO^  we  included  detector  noise  in  the  image  formation 
process  and  calculated  sample  SNRs  over  a  300-frame  ensemble.  For  these  simulations,  we  included  Poisson  (signal- 
dependent)  noise  from  a  double-precisian  "random  number  generator  and  zero-mean  Gaussian  (additive)  “read"  noise 
with  an  RMS  amplitude  of  7  e~  per  pixel  per  read. 

In  Fig.  7.  we  show  the  OTF  SNR  for  a  magnitude  10  object.  The  signal  level  to  the  WFS  is  unchanged,  since 
we  desire  only  to  study  the  effect  on  the  OTF  SNR  of  reducing  the  signal  level  with  the  VGP.  The  2-d  SNR  map 
has  been  radially  averaged  to  produce  a  simpler  x-y  plot.  Note  the  VGP/DM  SNR  is  significantly  higher  than  the 
DM-only  SNR  for  all  frequencies  beyond  the  seeing  spike.  The  relative  reduction  in  VGP/DM  SNR  near  DC  is 
due  to  the  increased  fluctuation  in  total.intensity  as  the  VGP  aperture  area  changes  randomly;  the  only  total  count 
variance  for  DM-only  AO  is  due  to  detector  noise. 

In  Fig.  8.  we  show  the  same  plots  for  a  magnitude  12  object.  The  curvature  threshold  is  the  same  as  was  used 
for  all  other  plots  shown  here.  Here,  the  DM-only  SNR  curve  is  slightly  higher  than  the  VGP/DM  SNR  at  all 
frequencies  of  interest.  Clearly,  the  SNR  gains  using  the  VGP  are  magnitude-dependent.  However,  we  point  out  that 
with  fixed  aperture  masks,  there  are  two  free  parameters  available  to  optimize  measured  visibility  SNR^®;  namely, 
the  size  and  number  of  the  mask  openings  (“subapertures").  While  there  are  not  generally  discrete  subapertures  in 
the  V'GP  as  there  are  with  a  fixed  aperture  mask,  the  ratio  of  turbulence  noise  power  to  detector  noise  power  can 
still  be  manipulated  by  adjusting  the  wavefront  curvature  threshold  and  by  restricting  the  regions  over  which  the 
V'GP  can  allow  light  to  pass.  For  dim  objects,  larger  subapertures  are  favored  in  fixed  aperture  masks,  even  though 
the  turbulence  power  in  the  data  increases.  By  analogy,  dim  objects  should  also  drive  a  VGP  controller  to  use  a 
lower  curvature  threshold,  increasing  the  pupil  area.  Dim  objects  also  dictate  a  reduction  in  the  number  of  fixed 
subapert vires:  this  reduces  the  number  of  baselines  among  which  light  must  be  shared  (naturally,  this  reduction  in 
the  number  of  subapertures  becomes  academic  for  objects  so  dim  as  to  indicate  aperture  masking  should  not  be  used 
at  all).  The  same  effect  can  be  obtained  with  the  VGP  by  allowing  the  VGP  controller  to  mask  regions  of  the  pupil 
regardless  of  the  wavefront  curvature.  For  example,  the  VGP  could  mask  all  but  a  small  annulus  of  the  pupil,  only 
opening  for  low-curvature  wavefronts  within  the  area  of  the  annulus.  These  ideas  and  optimal  control  rules  will  be 
studied  in  future  work. 


4.  SUMMARY 

We  have  described  a  variable-geometry  pupil  (V'GP)  to  increase  image  resolution  for  ground-based  near-IR  and 
optical  imaging.  A  curvature-type  wavefront  sensor  provides  an  estimate  of  the  wavefront  curvature  to  the  controller 
of  a  high-resolution  spatial  light  modulator  (SLM)  or  micro-electromechanical  (MEM)  mirror,  which  passes  or  reflects 
the  incident  beam  only  where  the  wavefront  phase  is  sufficiently  smooth,  viz.,  where  the  curvature  is  sufficiently  low. 
Using  a  computer  simulation,  we  have  shown  the  VGP  can  sharpen  and  smooth  the  long-exposure  PSF  and  increase 
the  OTF  SNR  for  tilt-only  and  low-order  AO  systems,  allowing  higher  resolution  and  more  stable  deconvolution  with 
dimmer  AO  guidestars. 

\Ve  point  out  that  the  VGP  can  be  used  for  long-exposure  imaging,  in  contrast  with  fixed-aperture  masks,  which 
are  typically  u.sed  to  enliance  visibility  SNRs  in  speckle  imaging  schemes.  Also,  we  note  that  while  the  VGP/DM 
PSFs  shown  here  have  more  core  energy  and  smaller  halo  features  close  to  the  core,  there  is  generally  more  energy 
in  the  VGP/DM  halo  at  radii  greater  than  about  an  arcsecond  from  the  core.  This  is  due  to  the  fact  that  the  VGP 
aperture  area  is  bounded  from  above  by  the  DM-only  aperture  area,  and  is  most  always  smaller.  Additionally,  we 
note  that  where  a  curvature-sensing  WFS  is  used,  the  VGP  uses  curvature  information  directly,  rather  than  after 
integration,  meaning  the  V^GP  controller  is  never  subject  to  the  numerical  errors  that  can  occur  using  a  large  number 
of  subapertures  in  a  curvature  sensor.  Finally,  we  point  out  that  the  sharper  core  obtained  with  the  VGP  could 
increase  the  sky  coverage  of  an  AO-compensated  spectrometer*’^  or  the  sensitivity  of  a  coronographic  imager. 
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Fig.  7;  OTF  SNR  curves  for  DM-only  (solid  line)  and  VGP/DM  (dashed  line)  AO  for  a  magnitude  10 
object. 


Fig.  7;  OTF  SNR  curves  for  DM-only  (solid  line)  and  VGP/DM  (dashed  line)  AO  for  a  magnitude  12 
object. 
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ABSTRACT 

When  measured  data  are  limited  in  quality  and  in  quantity  as  they  in  general  are.  the  addi¬ 
tional  information  contained  in  a  priori  constraints  can  often  lead  to  unique  image  reconstruction 
in  a  variety  of  algorithms.  In  a  linear  deconvolution  algorithm  devised  by  Tyler  and  Matson,^  an 
a  priori  knowledge  of  the  support  of  the  object  was  exploited  to  reduce  the  atmospheric  noise  in 
the  measured  astronomical  data.  Prasad*  has  recently  showed  that  such  noise  reduction  can  be 
described  in  terms  of  a  drift-diffusion  dynamics.  In  the  present  paper,  we  consider  the  problem 
of  constrained  image  deconvolution  from  Shannon's  mutual  information  viewpoint.  We  shall 
show  that  the  noise  dynamics  induces  a  corresponding  information  dynamics  as  well,  leading  to 
the  possibility  of  assigning  an  information-theoretic  measure  to  the  usefulness  of  constraints  in 
image  processing. 


I.  Introduction 

Statistical  information,  as  defined  by  Shannon.^  refers  to  the  amount  of  information  one  gains  about 
the  input  by  measuring  the  output  of  a  communication  channel.  In  the  absence  of  noise  or  another 
limitation,  the  output  furnishes  a  complete  specification  of  the  input,  and  the  communication  can  be 
thought  of  as  being  perfect.  By  regarding  the  most  general  imaging  system  as  a  communication  channel, 
with  object  inputs  and  image  outputs,  it  is  clear  that  one  can  characterize  the  performance  of  an  imaging 
system  too  in  terms  of  the  information  it  processes. 

The  use  of  information  theory  to  characterize  image  formation  was  first  discussed  by  Fellgett  and 
Linfoof’  not  long  after  Shannon's  seminal  work.  They  addressed  two  competing  viewpoints  about  the 
quality  of  an  image.  The  first  viewpoint  is  the  obvious  one.  namely  that  the  image  which  resembles  the 
object  most  is  the  best  image.  The  second  viewpoint  is  the  information  theoretic  one.  The  best  image 
must  carry  the  most  information  about  the  object.  That  the  second  viewpoint  is  generally  superior  is  at 
once  clear  when  one  recognizes  that  often  the  object  is  not  known  a  priori  so  the  question  of  resemblance 
of  the  image  to  the  object  cannot  even  be  answered  in  such  situations.  Fellgett  and  Linfoot  considered 
some  simple  examples  of  object  ensembles  and  additive  noise  with  Gaussian  probability  distribution  in 
the  same  spirit  as  Shannon's  Gaussian  channels,  but  further  use  of  information  theory  to  describe  imaging 
had  to  wait  until  Frieden's  work  in  the  early  seventies. 
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Taking  the  lead  from  Jaynes's  interpretation®  of  statistical  mechanics  from  the  information  theoretic 
viewpoint.  Frieden®  introduced  the  notion  of  maximum  entropy  to  characterize  the  least  biased  object 
intensity  distribution  0(x)  that  could  have  given  rise  to  the  measured  image  data.  The  maximum 
entropy  method  (MEM)  has  slowly  risen  in  importance  to  become  one  of  the  most  commonly  used 
algorithms  in  modern  image  processing.  A  somewhat  different  method,  based  on  maximizing  the  mutual 
information  rather  than  the  entropy,  was  also  proposed  some  years  later  by  Frieden.^  who  compared  the 
two  approaches.  In  both  approaches  the  intensity  at  pixel  x  is  taken  to  be  proportional  to  a  probability 
function  p(x)  and  statistical  considerations  concern  a  single  object  and  a  single  image,  rather  than 
statistical  ensembles  of  objects  arid  images. 

A  pnon  constraints  like  positivity  and  a  knowledge  of  object  support  can  provide  further  information 
on  the  object  and  constrain  the  reconstruction  of  the  object  from  limited  data.  It  has  been  shown  by 
Tyler  and  Matson^  that  the  imposition  of  the  support  constraint  not  only  limits  the  physical  extent  of 
the  reconstructed  object  in  a  trivial  manner  but  it  also  enables  a  reduction  of  noise  inside  the  support, 
thereby  improving  the  quality  of  the  measured  data.  This  noise  reduction  algorithm  was  analyzed  by 
Prasad"  who  modeled  the  noise  transport  in  this  algorithm  as  a  drift-diffusion  process  in  the  spatial 
frequency  plane. 

In  the  present  paper,  we  analyze  the  dynamics  of  mutual  information  throughput  of  an  imaging 
system,  in  much  the  same  spirit  as  Fellgett  and  Linfoot,"*  without  and  with  the  application  of  an  a  priori 
knowledge  of  the  object  support.  The  drift-diffusion  dynamics  of  Fourier  noise  induces  an  analogous 
dynamics  of  mutual  information  through  the  imaging  system  on  imposition  of  the  support  constraint. 
This  will  be  the  main  focus  of  the  present  work. 


11.  Noise  Dynaiiiics  under  Support  Constraint 


The  problem  of  interest  here  is  an  astronomical  imaging  problem,  in  which  a  ground-based  telescope 
form.s  the  image  of  an  object  in  the  sky.  With  no  or  incomplete  adaptive  correction  of  the  wavefront  errors 
caii.sed  by  atmospheric  turbulence,  the  raw  image  formed  by  the  telescope  is  correspondingly  noisy  and 
poorly  resolved  even  under  the  best  seeing  conditions.  The  image  formation  is  most  simply  described  in 
the  Fourier  (or  spatial-frequency)  domain,  where  the  telescope  and  the  atmospheric  turbulence  combine 
to  form  a  spatial  filter  with  a  noisy  multiplicative  optical  transfer  function  (OTF)  H(u)  as  a  function  of 
the  spatial  frequency  u.  The  Fourier  transform  of  the  (fluctuating)  image  intensity  is  related  to  that  of 
the  object  intensity  as 

/(«)  =  0(x!)H(n).  (2) 

We  assume  that  there  are  no  other  sources  of  image  noise,  so  the  multiplicative  noise  arising  from  the 
OTF  H .  noise  that  is  evidently  non-stationary.  is  the  only  one  we  consider  here. 


For  a  circular  telescope  pupil  of  large  diameter,  the  wavefront  is  composed  of  many  statistically 
independent  patches,  as  is  the  OTF.  Therefore,  by  the  central  limit  theorem,  the  statistics  of  the  OTF 
H(u)  is  described  approximately  by  a  Gaussian  probability  density  function  (pdf),  which  is  also  circular 
about  the  mean  value: 


najjiv) 


(3) 
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in  which  cr^{v)  is  the  variance  {\H{u)  —  (i/(i7))p). 

The  restriction  of  object  support  is  obviously  a  restriction  of  the  physical  domain.  Let  s(x)  be  the 
support  function,  taking  the  value  1  inside  the  support  and  0  outside.  Any  intensity  that  is  outside  the 
support  can  only  be  noise,  which  can  then  be  set  to  0  enabling  thus  a  trivial  reduction  of  noise.  The 
support-constrained  image  intensity  i,{x)  at  location  x  is  thus  just  the  raw  image  intensity  i{x)  at  that 
location  times  s(£).  In  the  Fourier  domain  this  becomes  a  convolution, 

/,(«)  =  y/(t7-tr)5(t/')dt7'.  (4) 

By  subsituting  expression  (2)  for  the  raw  image  spectrum*  /(u)  into  Eq.  (4),  we  see  that  the  noisy  OTF 
H{v)  induces  a  noise  in  the  support-constrained  image  spectrum  via  a  convolution. 

When  uncompensated  turbulence-induced  errors  are  significant,  we  can  show-  that  the  noise  in 
the  real  and  imaginary  parts  of  the  unconstrained  and  constrained  image  spectra  /({?)  and  Is(u)  are 
uncorrelated.  The  sum  of  the  variances  in  the  real  and  imaginary  parts  of  the  two  spectra,  on  the  other 
hand,  are  approximately  related  by  the  convolution 

A,(t7)  =  j  (5) 

where  A’(u)  =  {|/(w)  —  (f(w))|*)  is  that  variance  sum  in  the  unconstrained  image,  N,  is  the  corresponding 
sum  in  the  constrained  image.  D^(Xfui)  is  the  structure  function  of  the  uncompensated  phase  errors  at 
wavelength  A  of  the  imaging  light,  and  /  is  the  focal  length  of  the  telescope.  Note  from  Eq.  (2)  that 

N(u)  =  |0(t7)|V^(t/).  (6) 

By  expanding  the  noise  N(u-ui)  as  a  power  series  in  uj.  a  procedure  that  is  justified  for  sufficiently 
narrow  S(i7i)  (equivalently,  for  sufficiently  extended  support  size),  we  can  express  the  change  in  the  noise 
power  spectrum  under  the  application  of  support,  i.e..  AN(u)  —  N,(ii)  —  N(u).  to  second  order  as 

AN(v)  =  B  ■'VN{i!)  +  ^D  :VVN(v).  (7) 

The  vector  B  and  the  second-rank  tensor  D.  given  by 

B  =  -yd-u5(u)e-“/2)0*(A/«)j  (gg) 

and 

b-  I  (86) 

describe  a  decomposition  of  the  noise  transport  process  into  a  drift  and  a  diffusion  component,  respec¬ 
tively. 

It  is  worth  noting  that  the  tensor  D  can  always  be  diagonalized  by  a  principal  axis  transformation, 
with  the  eigenvalues  characterizing  the  two  principal  diffusion  constants  of  the  problem.  The  drift  vector 
B  represents,  on  the  other  hand,  a  ballistic  transport  of  noise  in  the  spatial  frequency  plane  and  is  driven 
by  the  noise  gradients  in  that  plane.  Note  that  the  drift  vector  (8a)  vanishes  for  any  inversion-symmetric 
support,  e.g..  for  a  circular  centered  support,  in  which  case  the  noise  transport  is  purely  diffusive.  A 
more  detailed  discussion  of  the  properties  of  noise  transport  and  how  that  can  be  used  to  reduce  the  noise 
inside  the  support  via  the  algorithm  of  Ref.  1  is  presented  in  Ref.  2.  We  turn  now  to  an  analysis  of  the 
information  dynamics  that  the  preceding  drift-diffusion  dynamics  of  noise  induces. 


*  Throughout  the  paper  we  mean,  by  the  phrase  “spectrum.’' 
variable  being  considered. 


the  spatial  frequency  dependence  of  the 
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III.  Information  Dyn^unics  under  Support  Constraint 


,  Consider  the  mutual  informafTon  l(u),  which  is  the  information  successfully  transmitted  by  an 
imaging  system,  at  a  given  location  v  in  the  spatial  frequency  plane: 

2(w)  ='Ho(«)-’Ho|/(«) 

(9) 

The  quantities  'Ho  -  Hi-  Ho]!-  and  'Hi\o  are  the  unconditional  and  conditional  information  entropies  at 
the  spatial  frequency  u  in  the  statistical  object  and  image  ensembles,  and  are  defined  by  the  relations, 

'Ho(u)  =  -j  d^O(v)Po(0(il))\ogPo(0(v)y,  (10a) 

?fo|/(J)  =  - 1  d^O(il)d^I(u)P(0{u),  /(«))  log  Po\i{0(v)\I(v)):  (106) 

and  the  relations  obtained  from  these  by  interchanging  I  and  O.  The  quantity  Pq  is  the  (unconditional) 
pdf  for  the  object  spectral  amplitude  0(u):  Po\i  is  the  conditional  pdf  for  the  object  spectral  amplitude 
0(i7).  conditioned  on  the  premise  that  a  particular  image  spectral  amplitude  value  /(u)  has  been  detected; 
and  P(0(i7). /(«))  is  the  joint  pdf.  These  unconditional  and  conditional  pdf’s  may  also  be  regarded  as 
pre  and  post  detection  pdf’s  for  the  object  spectrum  0(«). 

Before  proceeding  further,  we  note  that  we  are  led  by  the  spirit  of  Shannon’s  information  theory  to 
deal  with  object  and  image  ensembles,  rather  than  single  objects  or  images.  The  information  theoretic 
treatment  will  thus  address  a  class  of  objects,  and  is  naturally  disposed  toward  providing  design  and 
other  evaluational  considerations  for  an  imaging  system  that  may  use  a  particular  algorithm  on  a  variety 
of  objects. 

Tlie  two  alternative  expressions  for  the  mutual  information  in  Eq.  (9)  convey  the  input-output 
symmetry  of  the  information  measure.  When  the  prior  knowledge  of  the  object  support  is  used,  the 
image  spectrum  changes  from  Eq.  (2)  to  the  convolution  (4).  Such  a  change  obviously  causes  no  change 
in  the  object  pdf  Po  and  therefore  in  the  object  information  7io(d)-  Thus  the  change  AZ(u)  in  the 
mutual  information  when  the  support  constraint  is  applied  may  be  written  as 

AI(u)  =  -  A7{o|/(«) 

=A?f/(t7)-A7f/,o(u).  (11) 

.Although  formally  the  change  of  I  is  more  simply  related  to  the  conditional  information  'Ho]!-  the 
computation  of  the  conditional  pdf  f*o|;  >s  generally  harder  than  that  of  the  “inverse”  conditional  pdf 
Pi\o  that  goes  into  the  evaluation  ofHi\o-  Indeed,  both  for  the  unconstrained  and  constrained  image 
spectra  (2)  and  (4).  fixing  the  object  spectrum  0(w)  leads  to  conditional  variates  I  and  I,  that  are 
linearly  dependent  on  the  Gaussian  variate  H(v).  as  given  by  its  pdf  (3),  and  thus  are  themselves 
Gaussian  variates.  Their  pdf's  are 

P;|o(/(t7)|0(«))  =  (12a) 

7rjV(i/) 
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and 


(126) 


^  -|/(g)-(J.(g))P/JV.(g) 

rN,(v) 


where  the  superscript  (s)  refers  to  the  support-constrained  case  and  the  symbols  N(v)  and  N,(u)  refer 
to  the  image  noise  power  spectral  density  before  and  after  the  support  constraint  is  applied,  for  a  fixed 
object  spectrum  0{u).  They  are  the  same  as  the  N{u)  and  Ng(u)  defined  in  Sec.  II  and  related  by 
Eqs.  (5)  and  (6).  The  conditional  mean  values  (/(u))  and  (/»(«))  are  given  by  averaging  Eqs.  (2)  and  (4) 
over  the  fluctuations  of  H  for  fixed  0(v). 


The  pdf's  for  the  image  spectrum,  namely  P/(/{u))  and  pj‘\ls(u).  are  given  by  first  multiplying 
Eqs.  (12)  by  the  object-spectrum  probability  density  functional  (pdF)  Po[0{u)]  (rather  than  the  pdf  at 
a  single  spatial  frequency  since  all  spatial  frequencies  occur  in  the  convolution  (5))  and  then  integrating 
over  the  entire  object  ensemble.  From  the  so  constructed  image-spectrum  pdf,  one  then  has  to  use  the 
formula  (10a).  with  O  replaced  by  I.  to  compute  both  the  unconstrained  information  ?f/(n)  and  the 
constrained  one  and  the  change 


We  have  outlined  a  rather  ambitious  procedure  of  computing  the  change  in  the  mutual  information 
when  support  constraint  is  applied.  Analytically,  this  is  a  prohibitive,  if  not  impossible,  task.  We  therefore 
introduce  a  simplifying  assumption,  namely  that  the  change  in  the  unconditional  image  information. 
'Hi(tl).  is  negligible  compared  to  the  that  in  the  conditional  information  ?f/|o(w)-  The  justification  for 
this  assumption  is  that  the  pdf  for  the  image  spectrum  results  from  an  averaging  of  the  conditional  image 
pdf  P/|o(tf)  over  the  statistical  variations  in  the  object  ensemble.  Such  a  statistical  averaging  tends 
to  undo  the  frequency  mixing  caused  by  the  convolution  (4),  since  it  makes  the  typical  image  spectral 
distribution  in  the  ensmeble  of  such  distributions  have  a  more  uniform  character  in  frequency. 


With  this  assumption,  the  change  in  mutual  information  is  approximately  the  negative  of  the  change 
in  the  conditional  information 

AI(w)  fts  -A-H/|o(u).  (13) 

The  change  in  the  latter  can  be  analytically  computed,  given  the  Gaussian  pdf's  (12)  for  the  conditionals. 
Since 

?f/|o(w)  =  -{logF/|o(/(«)|0(i7))) 

and  since 

(|;(ii)-(/(t7))p)  =  A(i7). 

(and  similarly  for  the  support-constrained  image)  it  follows  that 


A-H,lo(i;)  =  (log  [^]  )  . 

wliere  tlie  averaging  is  over  the  statistical  variations  in  the  object  ensemble. 


(14) 


By  performing  the  series  expansion  (5).  dividing  it  by  N(v).  and  noting  that 

^  V  A  =  V  log  N.  =  W  log  A  +  V  log  A  V  log  A, 

we  can  express  the  change  AHi\o  as 

A?f;|o(t7)  ■  VW/,0  +  :  VV?f/|o 

-I-  ^b  :  ^Vlog  AVlog  a)  -  ((5  •  Vlog  A)^^  (15) 

consistent  to  second  order.  As  for  the  noise,  the  transport  of  conditional  information  proceeds  by  means 
of  ballistic  motion  (drift)  as  well  as  diffusion  in  the  spatial  frequency  plane.  However,  unlike  the  noise 
transport,  even  for  an  inversion-symmetric  support  for  which  B  vanishes,  the  transport  of  information 
remains  sensitive  to  the  noise  gradients  via  the  third  term  on  the  right  hand  side  of  Eq.  (15). 
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V,  Concluding  Remarks 


We  have  explored  in  this  paper  the  subject  of  transport  of  Shannon's  information  in  an  imaging 
system  when  the  prior  knowledge  of  object  support  is  employed.  Understanding  this  transport  is  critical 
to  divising  an  algorithm  by  which  one  can  imporve  the  information  throughput  of  the  imaging  system. 

A  related  but  much  deeper  question  concerns  a  quantitative  measure  of  information  that  one  can 
associate  with  the  a  priori  knowledge  modalities  like  support.  It  seems  intuitively  obvious  that  a  recon¬ 
structed  image  cannot  possess  more  information  than  contained  in  the  imposed  a  priori  constraints  and 
the  raw  measured  data  used  in  the  reconstruction: 

I[R]<I[C]  +  I[Ml  (16) 

where  /[/?].  I[C\.  and  I[M]  are  the  information  measures  of  reconstructed  image,  imposed  constraints, 
and  measured  data,  respectively.  But  what  is  /[C]?  Even  if  one  could  define  it,  it  will,  in  general,  depend 
on  both  the  object  statistics  and  the  algorithm  used  in  the  reconstruction.  It  is  clear,  however,  that  for 
a  given  algorithm,  one  could  pose  an  optimization  question:  For  what  object  set  is  the  information  I[C\ 
in  the  imposed  constraints  maximized?  Considerations  analogous  to  those  that  address  the  capacity  of 
communication  channels  must  be  employed  to  answer  this  question. 

This  work  was  partially  supported  by  the  Air  Force  Office  of  Scientific  Research  under  Contract  No. 
F49620-97-1-0148. 
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